Japanese Emotion Corpus Analysis and its Usefor Automatic Emotion Word Identification

نویسندگان

  • Junko Minato
  • David B. Bracewell
  • Fuji Ren
  • Shingo Kuroiwa
چکیده

In this paper, the creation of a Japanese emotion corpus and its use in automatic emotion word identification are examined. The corpus was created by manually tagging words in just under 1,200 dialog sentences with emotion. Using the tagged corpus, statistical analysis was performed to determine the characteristics of emotional expression in Japanese dialog. This type of analysis should prove beneficial for understanding how emotion is expressed and how to identify, classify, etc. emotion in Japanese. To test this theory an automatic emotion word identification system was built using machine learning based classifiers with features taken from the statistical analysis. In total, four different classifiers were trained and compared to a baseline dictionary approach. It was found that classifier based identification was able to significantly increase recall.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatically Annotating A Five-Billion-Word Corpus of Japanese Blogs for Affect and Sentiment Analysis

This paper presents our research on automatic annotation of a five-billion-word corpus of Japanese blogs with information on affect and sentiment. We first perform a study in emotion blog corpora to discover that there has been no large scale emotion corpus available for the Japanese language. We choose the largest blog corpus for the language and annotate it with the use of two systems for aff...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

An Exploration of Features for Recognizing Word Emotion

Emotion words have been well used as the most obvious choice as feature in the task of textual emotion recognition and automatic emotion lexicon construction. In this work, we explore features for recognizing word emotion. Based on RenCECps (an annotated emotion corpus) and MaxEnt (Maximum entropy) model, several contextual features and their combination have been experimented. Then PLSA (proba...

متن کامل

Automatic Annotation of Word Emotion in Sentences Based on Ren-CECps

Textual information is an important communication medium contained rich expression of emotion, and emotion recognition on text has wide applications. Word emotion analysis is fundamental in the problem of textual emotion recognition. Through an analysis of the characteristics of word emotion expression, we use word emotion vector to describe the combined basic emotions in a word, which can be u...

متن کامل

Construction of a Blog Emotion Corpus for Chinese Emotional Expression Analysis

There is plenty of evidence that emotion analysis has many valuable applications. In this study a blog emotion corpus is constructed for Chinese emotional expression analysis. This corpus contains manual annotation of eight emotional categories (expect, joy, love, surprise, anxiety, sorrow, angry and hate), emotion intensity, emotion holder/target, emotional word/phrase, degree word, negative w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Engineering Letters

دوره 16  شماره 

صفحات  -

تاریخ انتشار 2008